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Summary 


The  robust  analysis  of  linear  models  based  on  R-estimates  involves  an 
estimate  of  a  scale  parameter  which  is  used  in  the  analysis  as  a  standard¬ 
izing  constant.  The  consistency  of  previous  estimates  of  this  scale  para¬ 
meter  required  that  the  underlying  errors  be  symmetrically  distributed.  This 
assumption  is  not  always  warranted,  for  instance  in  survival  models.  A  new 
estimate  is  proposed  for  the  scale  parameter  and  it  is  shown  to  be  consis¬ 
tent  for  nonsymmetric  and  symmetric  error  distributions.  With  this  new  scale 
estimate,  a  complete  robust  analysis  of  a  linear  model  can  be  accomplished 
without  assuming  symmetry.  The  small  sample  properties  of  the  analysis  are 
examined  in  a  Monte  Carlo  study  of  several  different  situations. 


Some  keywords:  general  linear  hypothesis;  linear  model;  R-estimates;  rank 
statistics;  regression;  robust  analysis. 
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1.  Introduction  and  Summary. 


McKean  and  Hettraansperger  (1976,  1978)  proposed  a  robust  analysis  of 
linear  models  based  on  Jaeckel's  (1972)  robust  R-estimate  of  regression 
coefficients.  This  analysis  is  analogous  to  the  least  squares  analysis  of 
variance,  indeed  the  geometry  of  the  two  analyses  are  quite  similar,  McKean 
and  Schrader  (1980).  The  R-analysis  is  also  a  generalization  of  R-proce- 
dures  in  the  simpler  location  problems  and,  in  particular,  it  inherits  their 
asymptotic  relative  efficiency  properties  with  respect  to  least  squares. 

Small  sample  power  studies,  Section  4  and  Hettraansperger  and  McKean  (1983), 
support  these  asymptotic  efficiencies.  The  R-analysis,  thus,  offers  the 
user  a  versatile,  robust  alternative  to  least  squares  for  analyzing  linear 
models . 

The  R-analysis  requires  the  estimation  of  a  scale  parameter,  T  ,  which 
is  used  as  a  standardizing  factor  in  the  associated  R-test  for  a  general 
linear  hypothesis  and  in  the  standard  error  of  an  R-estiraate.  The  estimate 
of  t  proposed  by  McKean  and  Hettraansperger  (1976)  is  consistent  under  the 
assumption  of  symmetrically  distributed  errors.  There  are  many  situations, 
though,  when  the  assumption  of  symmetry  is  unrealistic.  For  example  many  of 
the  parametric  models  used  for  survival  time  have  skewed  distributions,  see 
Chapter  2  of  Miller  (1981).  In  accelerated  failure  time  models,  Chapter  2  of 
kalbfleisch  and  Prentice  (1980),  the  log  of  survivor  time  is  cast  in  a  linear 
model  whose  error  structures  have  skewed  distributions.  Many  of  these  skewed 
distributions  also  have  long  tails  in  which  case  a  robust  analysis  is  espe¬ 
cially  attractive. 

In  this  paper  we  propose  an  estimate  of  x  for  the  case  of  Wilcoxon 
scores.  The  proof  of  its  consistency,  Sections  3  and  5,  does  not  require 
symmetry.  Using  this  estimate  of  scale  the  R-analysis  of  McKean  and 


Het tmansperger  is  asymptotically  valid  for  both  symmetric  and  non-symraetric 
error  distributions.  Further,  as  noted  in  Section  2,  by  utilizing  the  theory 
of  Sievers  (1983)  the  asymptotic  theory  of  the  R-analysis  based  on  Wilcoxon 
scores  can  be  obtained  under  much  milder  regularity  conditions  on  the  design 
matrix  than  those  assumed  by  Jureckova  (1971)  and  Jaeckel  (1972). 

Asymptotic  theory  serves  as  a  useful  guideline  for  the  R-analysis  but 
for  practical  use  an  investigation  of  its  small  sample  properties  is  required 
as  discussed  in  Section  4.  The  estimate  of  T  is  based  on  residuals  and 
certain  small  sample  corrections  are  necessary  for  its  use  in  standardizing 
the  R-test  statistics.  We  present  the  results,  Section  4,  of  a  Monte  Carlo 
study  which  included  both  symmetric  and  skewed  distributions  over  several 
different  designs.  On  the  basis  of  this  study,  the  R-analysis  utilizing  the 

new  estimate  of  r  performed  as  well  as  that  using  the  old  for  the  symmetric 

distributions  and  performed  better  for  the  skewed  distributions.  Both  the 
old  and  the  new  were  robust  with  respect  to  least  squares.  On  the  basis  of 
the  theory  and  this  study,  we  would  recommend  using  the  R-analysis  with  the 
estimate  of  t  found  in  Section  3. 

2.1  Notation  and  assumpt ions . 

Let  Y  be  an  n  x  1  vector  of  observations  which  follows  the  linear 

model 

Y-al+xe+e  (2.1) 

where  a  is  the  intercept  parameter,  1  denotes  an  n  x  1  vector  of  ones, 

g  is  a  p  x  1  vector  of  parameters,  X  is  an  n  x  p  design  matrix,  and  e 

is  an  n  x  1  vector  of  independent  and  identically  distributed  errors  which 
have  the  common  density  function  f(x) .  We  will  assume  that  X  has  full 


r 


column  rank  p  and,  since  Che  model  includes  an  intercept  parameter,  that 
the  column  averages  of  X  are  zero.  We  denote  general  linear  hypotheses 

as, 

Hy :  HB  -  0  versus  H^:  HS  f  0,  (2.2) 

where  H  is  a  q  x  p  matrix  of  full  row  rank. 

Let  ft  denote  the  column  space  of  X.  We  can  express  the  model  (2.1) 
equivalently  as 

Y  «  p  +  e*  ,  U  e  ft  (2.3) 

where  e*  *  al  +  e.  Letting  w  denote  the  (p  -  q)  -dimensional  subspace  of 
ft  constrained  by  HB  *  0,  the  hypotheses  (2.2)  can  then  be  expressed  as 


Hq:  p  e  <d  versus  HA:  p  e.  ft\w  . 

Let  denote  the  projection  matrix  onto  ft. 

The  asymptotic  distribution  theory  discussed  in  this  paper  requires  some 
assumptions  on  the  subspace  ft  and  the  underlying  density  f(x) .  We  will 
consider  sequences  of  subspaces  ft^,  indexed  by  the  sample  size  n,  which 
have  a  common  dimension  p  and  which  satisfy 


1 ira  max  h . .  ■  0  (A.  1 ) 

n  -*»  l<i<n  lln 

where  h. .  is  the  ith  diagonal  element  of  the  projection  matrix  P„  .  For 
irn  r  J  ft 

n 

asymptotic  inference  concerning  B  we  will  further  assume  that  the  sequence 


of  design  matrices  X  satisfies 


1  im  —  X'  X  -  t, 
n  n  n 


L  is  positive  definite. 


(A. 2) 


We  will  often  suppress  the  subscript  n  on  ft  and  X. 
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For  the  density  function  f(x)  we  will  assume  the  following: 


(i) 

f  is  absolutely  continuous; 

(ii) 

/(f'/f)2f  dx  <  »; 

(B.l) 

(iii) 

t  -  I12l/2/f2dx]“1  <  •. 

2.2  R-analysis. 

The  part  of  model  (2,1)  involving  8  is  often  termed  the  deviations 
from  means  model.  Least  squares  inference  for  8  is  based  on  the  squared 
semi-norm 

I Iu| |Lg  m  l< u.  -  u)2,  u  £  Rn,  (2.4) 

which  can  also  be  written  as 

IMlJ  "  R'llli{1i  "  uj>2  (2*5) 

Kj  J 

Bor  Che  Wilcoxon  inference  on  6,  the  squared  differences  in  (2.5)  are 
replaced  by  the  absolute  differences;  Chat  is,  the  R-semi-norm  is  given  by 

||u||  ■  (2(n  +  l)/./l2)£  l\u.  -  u .  |  ,  u  e  Rn.  (2.6) 

i<j  '  J 

The  factor  in  parentheses  is  a  standardization  constant.  The  R-semi-norm 
can  be  expressed  more  familiarly  as 

|  |u|  L  -  OT2/(n  *  D)  l  (R(u .)•.-  ((n  ♦  l)/2))u., 

A  It 

where  R(u^)  is  the  rank  of  u^  among  u^,  ...,  un>  see  Hettmansperger  and 
McKean  (1978). 

The  estimate  of  6  obtained  from  this  serai-norm,  that  is,  a  value  of  8 
which  minimizes  | |Y  -  X8l|R,  is  the  R-estiraate  proposed  by  Jaeckel  (1972) 
for  Wilcoxon  scores.  In  terms  of  Model  (2.3),  McKean  and  Schrader  (1980) 
defined  yR  as  a  best  R-predictor  of  Y  when  yR  minimizes 


- - 


2 


>8^.- 


|  | Y  -  y |  |R,  y  c  ft .  Jaeckel  obtained  the  asymptotic  distribution  of  these 

R-estimates  under  conditions  more  restrictive  than  those  above;  however, 

Sievers  (1983)  showed  that  under  (A.l),  (A. 2),  and  (B.l)  the  distribution  of 
*  2-1 

BR  is  approximately  N(8,  T  (X'X)  )  where  T  is  defined  in  (B.l). 

The  least  squares  test  statistic  of  the  hypotheses  (2.2)  is  based  on  a 
comparison  of  the  squared  distance,  in  terms  of  the  norm  (2.4),  between  Y 


and  each  of  the  subspaces  m  and  ft  .  Analogously  McKean  and  Het tmansperger 
(1976)  proposed  as  an  R-test  statistic 


lly-“JIR-  II? 

(P  -  Oi/2 


(2.8) 


where  y  and  y_  are  best  R-predictors  of  Y  from  the  subspaces  oj  and 
0)  ft 

ft  and  t  is  an  estimate  of  x.  Similar  to  least  squares,  the  numerator  of 
FR  is  a  comparison  of  R-distances  between  Y  and  the  subspaces  at  and  ft  . 
The  test  is  to  reject  in  favor  of  for  large  values  of  FR.  McKean 

and  Het tmansperger  showed  that  under  and  regularity  conditions, 

(p  -  q)FR  has  an  asymptotic  chi-squared  distribution  with  (p  -  q)  degrees 
of  freedom,  provided  x  is  a  consistent  estimate  of  x.  Using  Sievers' 
(1983)  development,  the  conditions  (A.l),  (A. 2),  and  (B.l)  suffice  for  the 
regularity  conditions. 

A  Wald  type  of  test  for  Hq,  based  on  the  full  model  R-estimate,  is  to 
reject  for  large  values  of  the  statistic, 

ChL)'(h(x'x)~1h')"1(hL) 

B  -  - - - - —  (2.9) 

(p  -  q)T 

Unlike  their  least  squares  counterparts  the  R-statistics  B  and  F  are 
not  algebraically  equal;  however  they  have  the  same  asymptotic  null  distri¬ 
bution  and  asymptotic  relative  efficiency,  see  Hettmansperger  and  McKean 
(1983).  The  statistic  B  is  similar  to  Bickel's  (1976)  test  based  upon 
pseudo-observations  obtained  from  an  M-estimate  of  8. 


Although  the  asymptotic  theory  suggests  chi-squared  critical  values, 
from  previous  Monte  Carlo  studies  we  have  found  it  best  to  compare  the  sta¬ 
tistics  Fr  and  B  with  F-critical  values  having  p  -  q  and  n  -  p 
degrees  of  freedom.  This  is  discussed  further  in  Section  4. 

3.  Estimate  of  x . 

1 

In  order  to  use  the  R-analysis,  the  scale  parameter  T,  (B.l),  needs 
to  be  estimated.  In  the  simple  location  problem  under  the  additional  assump¬ 
tion  of  a  symmetric  distribution,  Lehmann  (1963)  proposed  a  consistent  esti¬ 
mate  of  T  based  on  the  length  of  a  distribution  free  confidence  interval 
for  the  location  parameter.  For  the  R-analysis  of  linear  models,  McKean  and 
Hettmansperger  (1976,  1978)  proposed  using  Lehmann's  estimate  of  x  computed 
from  the  residuals.  Under  the  assumption  of  symmetrically  distributed  errors 
they  showed  the  estimate  was  consistent.  For  the  asymptotic  distribution 
theory  of  the  R-analysis,  though,  this  was  the  Only  place  where  symmetry  was 
invoked.  In  this  section  we  propose  a  new  estimate  of  x  and  show  that  it 
is  consistent  under  the  assumptions  (A.l),  (A. 2),  and  (B.l).  The  R-analysis 
of  Section  2.2  can  thus  be  employed  for  both  symmetrically  and  nonsymraet r i- 
cally  distributed  errors. 

We  find  it  more  convenient  to  work  with  the  parameter  y  defined  by 

y  -  (/IT  x)"1  -  If2  '  (3.1) 

Note  that  the  difference  of  a  pair  of  error  random  variables, 
e^*  -  e^*  *  e^  -  e^,  has  as  its  density, 

g(t)  -  / f(x)f(x  -  t)dx.  (3.2) 


This  suggests  estimating  Y  with  an  estimate  of  g(0) . 
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Lee  Z  denote  the  vector  of  R-residuals, 

Z  -  Y  -  y_  (3.3) 

K 

A  A 

where  ■  Xg^  is  a  best  R-predictor  of  Y  from  ft.  We  are  interested  in 
the  differences  of  the  residuals,  Z.  -  Z^ ,  and  since  these  are  symmetric 

about  0,  we  need  only  consider  the  set  of  |Z^  -Zj |  for  i  <  j.  The 

corresponding  empirical  distribution  function  is 

G  (t)  -  (I?)'1  l  $<|Z.  ~  Z.\,  t)  (3.4) 

n  2  .  £ .  *  1  i  l 1 

Kj 

where  $(u,v)  *  1  or  0  if  u  <  v  or  u  >  v.  Our  estimate  of  y  is  a 

measure  of  the  slope  of  Gn  at  0,  namely 

Y  -  G  (t  /Yn) / ( 2 1  /Yn)  (3.5) 

n  a  a 

A  A 

where  is  the  ath  quantile  of  G^(t).  The  corresponding  estimate  of  t 

is 

X  *  (/I2Y)"1.  (3.6) 

Some  discussion  on  the  choice  of  o  can  be  found  in  Section  4. 

The  consistency  of  x  is  noted  in  the  following  theorem  which  is 
proved  in  more  generality  in  Section  5. 

Theorem  3.1.  Consider  a  sequence  of  linear  models  which  satisfies  assump- 

•>  p 

tions  (A.l),  (A. 2),  and  (B.l).  For  0<a<  1 ,  x  -*•  x  . 

If  t  is  estimated  by  T  of  (3.6)  then  the  R-analysis  discussed  in 
Section  2.2  is  asymptotically  valid  for  both  symmetric  and  non-symmetr ic 
error  distributions.  It  can  thus  be  used  for  instance  in  the  applications 
mentioned  in  Section  1.  The  small  sample  properties  of  x  are  discussed  in 


the  next  section. 
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The  algorithm  we  employ  to  compute  Y  is  an  iterative  procedure  which 

exploits  the  relationship  between  G^  and  the  Wilcoxon  two  sample  process 

defined  below.  This  procedure  avoids  the  storage  and  sorting  of  the  (”) 
differences,  requiring  only  vectors  of  length  2n. 

In  order  to  define  the  two  sample  process  consider  as  one  sample  the 
residuals  and  as  a  second  sample  exactly  the  same  set  of 

residuals.  The  Wilcoxon  process  is 

n 

W  (t)  «  l  R(Z.  -  t)  -  n(n  +  l)/2 
n  .  .  i 

i“l 

where  R(Z.  -  t)  denotes  the  rank  of  Z.  -  t  among 

Z.  -  t,...,Z  -  t,  Z.,...,Z  .  For  t  >  0,  it  can  be  sho  ,.nat 

i  n  i  n 

G  (t)  -  1  -  0_1W  (t). 
n  2  n 

The  quantile  t  can  be  obtained  by  solving  the  equation  1  -(”)  *W  (t  )  =  a, 
which  can  be  solved  iteratively  using  a  modified  version  of  regular  falsi 
similar  to  the  algorithm  presented  by  McKean  and  Ryan  (1977). 

In  the  simple  location  problem  Schweder  (1975)  proposed  an  estimate  of  y 
which  is  basically  a  window  estimate  and  he  recommended  a  uniform  window  sym¬ 
metric  about  zero.  Assuming  p.  *  0  for  all  i  so  that  no  nuisance  parame¬ 
ters  need  be  estimated,  his  estimate  is 

7  *( |Y.  -  Y. j  ,  h  /2)/n2h 
■  ^  .y  j  i  '  n  n 

i.J 

-  ((n  -  l)/n)G  (h  / 2 ) / h  +  (1/nh  ) 

n  n  n  n 

*  G  (h  /2)/h 
n  n  n 


where  h  is  the  width  of  a  uniform  window.  The  estimate  (3.5)  is  seen  to 
n 

-1/2 

be  similar  to  this  type  of  estimate  with  a  window  width  of  0  (n  ).  In 

P 

proving  consistency  and  asymptotic  normality,  Schweder  assumed  the  window 

/—  ^  —I/O 

n  h^  -*■  ®,  ruling  out  the  0(n  )  cases,  and  recommended 


i 
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0(n 


■1/3, 


n 

Our  estimate  y  differs  from  Srh-.t-aer's  estimate  by  allowing  for  the 
estimation  of  nuisance  parameters  in  using  residuals  rather  than  iid  vari¬ 
ables.  Also  it  uses  a  window  width  estimated  from  the  data.  In  an  actual 
application  of  Schweder's  estimate  the  choice  of  an  appropriate  s  for 

h  ■  sn  is  difficult  to  make  and  his  Monte  Carlo  study  indicates  chat 

n 

the  bias  of  the  estimate  is  sensitive  to  the  choice  made. 


4.  Small  Sample  Properties 

In  this  section  we  discuss  a  Monte  Carlo  study  of  the  performance  of  i  on 

F  and  B  in  several  small  sample  cases  chat  reflect  problems  arising  in 

practice.  The  results  show  that  some  modification  is  necessary  to  obtain 

satisfactory  results.  After  some  small  sample  corrections  are  made,  the  new 

R-method  performs  quite  well.  It  is  a  definite  improvement  over  the  old 

R-method  for  nonsymmetric  distr ibut ions  and  is  superior  to  least  squares 

methods  in  maintaining  level  and  power  for  nonnormal  distributions. 

The  performance  of  t  as  an  estimate  of  t  can  be  studied  by  examining 

bias  and  mean  square  error.  Instead  of  this  we  chose  to  examine  directly  how 

well  t  standardizes  the  test  statistics  F  and  B.  The  inclusion  of  B 

R 

in  Che  study  gives  information  on  how  well  T  performs  for  confidence  inter¬ 
vals  since  confidence  regions  for  B  based  on  B  -.  can  be  obtained  by 

R 

inverting  the  test  statistic  B  for  appropriate  H  matrices. 

The  scale  estimate  *  is  based  on  residuals  which  tend  to  be  less  vari¬ 
able  chan  independent  observations.  This  effect  diminishes  for  large  sample 
sizes,  but  for  smaller  sample  sizes  the  dependence  among  residuals  may  cause  t 
to  regularly  underestimate  T.  An  indication  of  this  effect  is  shown  in 
Figure  4.1.  This  figure  shows  graphs  of  the  empirical  distribution  functions 
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functions  of  the  differences  |e.  -  e.l  of  random  errors  and  of  the 

1  i  j' 

differences  of  R-residuals  |Z^  -  |  for  one  simulation  of  lognormal 

variables  in  design  A  below.  This  figure  is  typical  of  many  others  we  have 
examined.  Note  that  the  differences  of  R-residuals  are  stochastically 
smaller  chan  the  differences  of  Che  independent  observations.  As  a  result, 
the  estimate  of  slope  y  is  larger  for  residuals  than  for  actual  errors  and 
the  reciprocal  is  then  biased  downward. 

One  of  the  more  successful  corrections  we  considered  was  the  standard 
least  squares  correction,  namely, 

x  *  x(n/(n  -  p))*^.  (4.1) 

As  noted  in  Figure  4.1,  this  has  the  effect  of  increasing  the  horizontal 
coordinate  used  in  the  slope  estimate  and  thus  decreasing  the  slope. 

Another  small  sample  correction  that  improved  the  empirical  results  is 
to  use  a  critical  value  from  the  F  distribution  with  p  -  q  and  n  -  p 
degrees  of  freedom  instead  of  a  chi-square  critical  point.  The  R-analysis 
then  uses  the  same  critical  values  as  the  least  squares  analysis. 

4.2  Monte  Carlo  Study 

The  empirical  study  here  considers  the  tests  based  on  F  and  B  with 
t*  of  (4.1)  used  in  the  denominator.  Results  are  reported  here  for  t* 
based  on  quantiles  ct  =  .80  and  .90  with  the  corresponding  test  statistics 
denoted  FR(a)  and  B(a).  The  rank  statistic  of  hettmansperger  and  McKean 
(1976,  1978),  which  is  appropriate  for  symmetric  error  distributions,  is  also 
included  and  denoted  by  Fq.  It  uses  the  numerator  of  F^  standardized  by 
Lehmann's  estimate  of  x  (see  section  3),  The  specific  estimate  we  used  is 
given  in  formula  (2.7)  of  Hettmansperger  and  McKean  (1978)  based  on  an  80% 
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confidence  level  and  with  che  same  small  sample  correction  as  expressed  in 
(4.1)  here.  The  least  squares  analysis,  denoted  by  LS ,  is  also  included  to 
provide  a  reference  point  and  show  the  potential  gain  in  using  the 
R-analysis  instead  of  the  classical  analysis. 

The  R-analyses  were  computed  by  using  the  k-step  R-estimates  with 
k  *  2.  McKean  and  Hettmansperger  (1978)  noted  the  close  agreement  between 
the  analyses  based  on  the  2-step  estimates  and  those  based  on  the  fully 
iterated  estimates,  which  we  have  confirmed  in  a  small  pilot  study.  The 
analyses  based  on  the  k-step  estimates  have  the  same  asymptotic  properties 
while  taking  much  less  time  to  compute.  The  M-estimates  of  Huber  (1973) 
were  used  as  starting  values.  The  estimates  of  t  were  computed  by  an 
iterative  routine  as  noted  in  Section  3.  The  actual  algorithms  used  are  des¬ 
cribed  in  Hettmansperger  and  McKean  (1983). 

We  examined  che  behavior  of  the  tests  over  three  designs.  One  was  the 
same  unbalanced  2x2  design  considered  by  McKean  and  Hettmansperger  (1978) 
which  has  cell  sizes  of  n^  *  n ^  *  &  and  n^  *  n2l  *  We  considered 

the  three  hypotheses:  hqi’  average  row  effects  the  same;  hq2 '  average 
column  effects  the  same;  and  H^^ ,  no  iteraction.  For  this  design,  which 
we  label  A,  the  sample  size  is  26,  p  =  4,  and  p  -  q  =  1  for  all  three 
hypotheses , 


The  other  two  designs  each  involve  two  regression  lines.  The  design 
matrices  are  of  the  form, 


*-  (!  ? 

X 

X 

°) 

• 

For  design  B  the  1 

denotes  a  vector 

of 

10 

ones 

and  x 

is 

the  vector 

(-l,-.8,...,-.2,.2,.. 

. , 1 ) ' .  Design  C 

was 

a 

larger 

vers  ion 

of 

B,  with  the 

x  vector  consisting  of  the  20  values,  (-1,  ~.9,...,.l,  ,1 . 1)'.  For 

each  design  we  considered  the  three  hypotheses:  same  intercept;  Hq2’ 


same  slope;  and  same  regression  line.  Thus  the  sample  sizes  for  the 

designs  are  20  and  40,  p  is  4  for  both  designs,  and  p  -  q  is  1  for 

the  first  two  hypotheses  and  2  for  the  third  hypothesis. 

Four  error  distributions  were  considered  in  this  study.  Two  of  the  dis¬ 
tributions  were  symmetric:  the  standard  normal  (N)  with  distribution  func¬ 
tion  #(x)  and  a  contaminated  normal  (CN)  with  distribution  function 
F(x)  ■  .85  $  Cx)  +  .15<t(x/7).  The  other  two  distributions  were  skewed:  a 

lognormal  (L)  with  distribution  function  F(x)  *  $(log  x)  and  a  skewed 
contaminated  normal  (SCN)  with  distribution  function 

Fix)  *  .82  $ Cx)  +  .18$((x  -  1.9)/13.5).  The  last  is  a  moderately  skewed  dis¬ 
tribution  proposed  by  Draper  (1981). 

The  normal  variates  were  obtained  by  the  transformation  on  a  pair  of 
uniforms  as  proposed  by  Marsaglia  and  Bray  (1964)  while  the  uniforms  were 
generated  by  the  algorithm  UNI  developed  by  Gross  (1979). 

For  each  situation,  a  design  and  a  distribut ion,  we  ran  1000  simula¬ 
tions.  Both  empirical  levels  and  powers  were  investigated.  Vie  chose  alter¬ 
natives  to  give,  for  the  most  part,  a  spectrum  of  the  power  curves. 

4.3.  Results. 

We  will  summarize  our  results  for  the  tests  in  terms  of  empirical  levels 
and  powers  for  three  alternatives.  For  a  nominal  -.05  level,  the  empirical 
levels  are  recorded  in  Table  4.1  and  the  powers  are  in  Tables  4.2  -  4.5.  The 
results  for  nominal  levels  .01  and  .10  were  similar.  Since  the  results 
are  based  on  1000  simulations,  two  standard  errors  for  the  levels  is  about 
.014.  In  Table  4.1  a  plus  indicates  that  the  value  is  above  .05  by  two 
standard  errors  while  a  minus  indicates  chat  it  is  below. 
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The  effect  of  a  on  the  estimate  of  r  is  apparent  from  Table  4.1. 

The  levels  for  a  *  .90  are  generally  much  more  conservative  than  those  for 
o  *  .80.  Although  not  tabled,  we  found  that  this  trend  continued  for  a  *  .95. 
The  choice  of  a  seems  important.  In  this  study  the  value  of  a  m  .80 
produced  the  best  results.  Note  that  the  Wald  type  test  is  slightly  more 
liberal,  at  a  ■  .80,  than  the  drop  in  dispersion  test.  The  comparison  of 
interest  is  between  Dq  and  FR(.3).  Their  empirical  levels  for  the 
symmetric  distributions  are  quite  close.  This  is  true  for  the  skewed  con¬ 
taminated  normal;  however,  in  the  case  of  the  lognormal  the  levels  of  are 
much  smaller. 

For  the  empirical  powers  we  only  tabled  the  results,  of  the  procedures 
based  on  a  ■  .80.  The  table  index  for  the  alternatives  is  the  parameter 
X  m  2  ^ (Hg) ' (H(X' X)  *H')  *Hg  where  8  is  the  vector  of  alternatives.  Hence  A 
is  the  non-centrality  parameter  for  the  least  squares  F-test  at  the  standard 
normal . 

For  the  normal  distribution,  least  squares  dominates,  as  it  theoretical¬ 
ly  should,  over  all  the  R-procedures  but  not  by  that  much.  The  R-proce- 
dures  essentially  have  the  same  power,  among  themselves.  The  reverse  is  true 
for  the  contaminated  normals  where  all  the  R-procedures  are  substantially 
more  powerful  than  least  squares  over  all  the  designs.  Among  the  R-proce- 
dures ,  perhaps  D^  has  a  slight  edge.  The  same  is  true  for  the  skewed 
contaminated  normals  except  Chat  the  R-procedures  based  on  the  new  estimate 
of  t  behave  slightly  better.  For  the  lognormal  distribution  the  new 
R-methods  are  more  powerful  than  over  all  the  designs.  In  this  case  the 

procedure  tends  to  be  conservative,  especially  in  the  large  Design  C; 

however,  even  here  it  is  more  jxjwerful  than  least  squares. 
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5.  Consistency  for  the  Scale  Estimate 

We  will  consider  a  more  general  estimate  of  T  based  upon  a  weighted 
empirical  distribution  function  given  by 


H  (t)  -  7  a..*(|Z.  -  2.1,  t) 

n  .“.  ij  1  l  i 1 

i<j 


(5.1) 


where  Z  is  the  vector  of  R-residuals  given  by  (3.3)  and  the  weights 
are  non-negative,  £a^  *  1,  and  satisfy 


2  r  2  _  _  2 


n  >  a. .  < 

L  ij  ~ 


(5.2) 


for  some  constant  B^.  The  unweighted  case,  a.^  ■  (^)  ,  provides  the 

estimate  of  t  found  in  Section  3.  Other  weights  can  be  used  for  special 
purposes.  For  instance,  in  analysis  of  variance  models  and  stratified  sam¬ 
pling  models  where  the  data  arise  naturally  in  groups,  the  use  of  weights  a^ 
*  0  if  Z^,  Zj  are  from  different  groups  would  leave  the  estimate  dependent 
only  on  the  within-group  variation  of  the  data.  Estimates  with  this  feature 
have  been  proposed  by  Draper  (1981).  As  in  Section  3,  for  0  <  a  <  1  define 
an  estimate  of  Y  by 


v  »  H  (t  //H)/( 2  t  //5)  (5.3) 

'no  no  o 

«  * 

where  t  is  the  ath  quantile  of  H  (t). 
a  n 

Consider  a  sequence  of  models  Y  *  p  ♦  e  "where  y  e  fl  ,  fl 
^  n  n  n  n  n  n 

satisfies  the  assumptions  (A.l)  and  (A. 2),  and  the  vector  of  errors  e^ 

satisfies  (B.l).  Since  the  estimator  y  is  translation  invar iant .without 

n 

loss  of  generality  we  will  assume  that  the  true  value  of  ^  is  0.  These 
assumptions  will  hold  for  the  remainder  of  the  paper. 

In  order  to  facilitate  the  following  proofs  consider  the  subset  of 


I 
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ft  defined  by 
n 


Mn  -  {y  e  ftn:  |  Jy|  |  _<  A}, 


(5.4) 


where  A  >  0  is  a  given  constant.  By  assumption  (A.l)  it  follows  that, 


lim  sup  max  |y.  -  y.  | 

n-*»  jj  gm  l<i<j<n  Jn  10 


(5.5) 


n  n 


For  5  >  0  we  construct  a  covering  M.  ,  . . .  ,  of  M  as  follows: 

i  r  Kn  n 

Let  C  be  an  n  x  p  matrix  whose  columns  form  an  orthonormal  basis  for  ft 
n  n 

Let  D  *  {0  e  R*5:  ||0||  A),  where  A  is  given  in  (5.4).  Since  D  is 

compact  we  can  find  an  integer  K  along  with  points  6^ ,  ...  ,0R  in  D  such 

that  DCyD  where  D,  ■  (6:  I  I  0  —  0,  I  I  <  5}.  Defining  M,  *  C  (D,  )  for 
.  k  *  11  °  kn  n  k 


k 


k  “  1 ,  . ..,  K,  it  follows  that  M  C  U  M.  •  For  any  such  covering  of  M 

n  k  Kn  n 

let 


dijkn  ’  SUP *  >V  K0  *«*„> 

dijkn  '  inf(),jn  ‘  "l„:  "n  e  "kn  >• 


(5.6) 


where  k  ■  1,  ...,  K  and  1  <_  i  <  j  <_  n.  We  need  the  following  inequality: 


Lemma  5.1.  For  &  >  0  let  M 


In’ 


,  be  the  associated  covering  of 


M  .  Then  for  k 
n 


1 ,  •  ■  • ,  K. 

-1  V 

n  l  <di 


Kj 


ijkn  dijkn 


2  2 
)  <.  2p6;  . 


Proof: 


Let  c.  denote  the  ith  row  of  the  matrix  C  . 
in  n 


Then 


,U 


dV.  ■  sup{(c.  ■  c.  ) 1  9:  0  e  D.  } .  Since  D,  is  compact  we  have  d..,  * 

ijkn  jn  in  k  k  v  ijkn 

(c.  -  c.  )'©¥.,  for  some  0.H,  e  D,  .  Similarly  dK.  *  (c.  -  c.  )'9f.t 

jn  m  ijk  ijk  k  J  ijkn  jn  in  ijk 

for  some  8^^  e  By  the  Cauchy-Schwarz  inequality  we  then  obtain 
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( 1/n)  l  (dV..  -  d*\.  )2  <  ( 1/n)  l  ||c.  -  c.  | |2| | e“. 

ijkn  ijkn  —  . f; . M  jn  inM  1  1  ijk 


£■1 

Define  the  process 


<  2<S^  >  (1/n)  J  ( c . 


KJ 

2  2 

-  c.„  )  *  2d  p. 


^kN2 


j£n  i£n 


Rn(t*  V  V  "  Hn<t/j^»  V  Yn)  “  2cY> 

where  H  (t/./n,  n  ,  Y  )  *  J  a.  .  <p  (  |  ( Y .  -  q-  (Y.  ~  n-  )|,  t/Vn)  . 

n  ti*  n  ij  y  1  jn  ]n  in  T.n  1  ’ 

Lemma  5.2.  Under  Che  above  assumptions,  for  q^  e  and  t  >  0, 

p 

R  (t,  q  ,  Y  )  -►  0  as  n  -»■  ®. 
n  ’  'n  n 


Proof: 


Rn^c’nn*Yn^  ”  ^  l  a.  CW.  .  -  2ty/>/n) 
n  n  n  ^ 

where  *  d> C|  Yj  -  Yi“  dy  |  ,  t//n)  and  d^  -  -  nin  .  Using 

E(Wtj)  *  G(dij  +  t/ Jn)  -  GW^-t/t'n) 

■  g(5i^)(2t/*^T) 

where  -  d^  |  <  t/Sii  and  y  -  g(0)  ,  it  follows  that 

E(R .  (t,n,Y  ))  -  2t  l  a..(g(.5.,)  -  g(0))  . 
n  n  n  ^  ij  1J 


By  (5.2),  (5.5),  and  the  continuity  assumed  for  g,  E(R  (t,  n  ,  Y  ) )  0 

as  n  -*>  “. 

Using  assumption  (5.2)  and  |W^  |  1 ,  a  standard  argument  shows  that 

var(R  (t,  n  ,  Y  ))  -*■  0  as  n  ».  With  the  mean  and  variance  tending  to 
n  n  n 

zero  the  lemma  follows. 


This  lemma  shows  that  /n  H  (t/fn,  h  ,  Y  ) / 2 1  converges  in 

n  n  n  ° 

probability  to  y  but  the  result  is  not  strong  enough  since  an  appropriate 
t  must  be  chosen  to  fit  the  data  and  the  nuisance  location  parameter  1 

n 

must  be  estimated.  The  following  theorem  shows  that  the  convergence  in  the 
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lensna  is  uniform  and  this  will  allow  the  construction  of  a  suitable  estimate 
of 


Theorem  5.1.  Let  tg  and  A.  be  positive  numbers.  Then 

sup  lR(t,  n,Y)|^-0  as  n^“. 
w  n  ’  n  n  1 


n  eM  ,  0<t<t 
n  n - 

Proof  of  Theorem  5.1 


This  proof  is  somewhat  long  and  it  will  be  divided  into  pieces  with  some 

lemmas.  Begin  by  choosing  any  e  >  0.  Choose  <$  >  0  sufficiently  small  so 

that  b/Tp  g(0)Bg  6<  e/6,  *  here  Bq  is  specified  in  (5.2).  For  this  6 

form  the  coverings  M,  ,  k  *  1,...,K  and  the  extreme  differences  dV.,  and 

Kn  i  jkn 

dVjkn  (5.6).  Choose  6'  >  0  so  that  6g(0)d'  <  e/6.  Partition  the 
interval  T  *  [0,  tg]  into  subintervals  T^,...,T  of  width  less  than  <5'  . 
For  each  m  *  1 , . . . ,M  let 

Ur  .  L 

t  ■  suplt:  t  e  T  }  and  t  ■  xnflt:  t  e  T  }. 
m  r  m  m  m 


Then  for  each  m  ■  l , . 


(5.7) 


For  each  k  ■  1 . K  and  m  ■  1,...,M,  define 


S . . (k,m)  * 
ij 


1,  if  Y.  -  Y.  e  (dL..  -  tU//n,  dU.u  -  tL//^i) 

j  x  ljkn  m  .  xjkn  m 

or  Y.  -  Y.  e  (dV..  ♦  tL//n,dU.  ♦  tU//n) 
j  x  xjk  m  xjk  m 

0,  otherwise 


for  l<i<j<n  and  Q  *  ^tT£a..S..(k,m). 
—  —  k,m  xj  lj 


Lemma  5 . 3  For  each  k  ■  1,...,K  and  m  ■  1 , . . . ,M 


sup  |R  (t,  H,  Y) 
t.t'eT  " 


-  R  (t\  n\  Y)|  <  q  ♦  zr«' 

n  •  Ktn 


Proof:  Fix  k  and  in  and  write 


R  (t,  n.,  Y)  -  R  ( c  *  ,  rv'  ,  Y)  -  Vn  £  a.  .(W.  .  -  w!  .)  -  2y(t  -  t’) 

where  U  -  *(|Yj  -  Y.  -  d.  | ,  t//T) ,  W*.  -  *(  |Y  -  Y.  -d!  .  |  ,  1 7  -6) , 

d..  ■  n.  -  n.  and  d!  .  ■  n'.  “  n!  .  Inspection  of  possible  cases  shows 

ij  jn  in  ij  jn  in  r 

that  |VT.  -  W'  |  <  S„(k,m)  for  all  1  _<  i  <  j  _<  n,  all  n,  n'  e  M^n  and 

1J  ij  1J  " 

all  t,  t'  e  T  .  Then  the  lemma  follows, 
m 

Since  g  is  continuous  at  0,  there  is  a  5"  >  0  such  chat  |d|  <  6" 
implies  g(d)  <  2  g(0) .  Then  from  (5.5)  there  exists  an  integer  such 

that 


(-6*72)  <  dL.,  <  dU.,  <  (572) 
—  ljkn—  ijkn  — 


(5.8) 


forall  k  ■  1 , . . .  ,K,  all  1  ^  i  <  j  ^  n  and  all  n  _> 


Lemma  5.4.  For  all  k  ■  1 , . . . ,K  and  m  *  1,...,M, 


(a)  there  exists  an  integer  such  that 

E(Q^  b)  (e/6)  +  4  g(0)6'  for  all  n  >  ,  and 

(b)  var(Q.  )  +  0  as  n  ®. 

k,m 


Proo  f : 
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E(Vm>  ■  ^  £  aij  E< Vk-m» 

" 15  J,  V'G(d«k„-^>  - 

+  wwykn*#1®  - 

■  *  Jj  V8(Vk-”>)IdUk„-dW(‘^)''5) 

+  8<V1J  <k-*))Id«kn-dljkn+  (t!- 

"here  dljkn~  ^  i  ««»•■)  1  ijjkn-  and  dkJk[>+  tk/^T  < 

v^Ck.m)  <  d“jkn+  t“//n  for  all  1M  <];n. 

Now  there  is  an  integer  N2  >  Nj  such  that  tQ / ^rT  <  6"/ 2  for  all 
n  L  N2*  Th*3  along  with  (5.8)  implies 


-6"  <  5.j(k,m)  <  5"  and  <  Vlj(k,ml  <  &" 

for  all  1  i  <  j  n  and  all  n  >_  N^.  Then  by  the  choice  of  &"  it 
follows  that 


E(Q  )  <  A  4 
k.ra  — 


g(0)  l 
I  4  g(0)  /n  l 


r.U  .L 

a.  .  Id  .  -  d  . 

ij  tjkn  vjkn 

,.U  L 

L<  j  »-J  iJ^n  ijkn 


+  (t 


m 


-  t 


m 


)/Vn) 


]  +  4  g(0)d’ 


from  (5.7)  for  all  n^N^.  Applying  the  Cauchy-Schwarz  inequality  and  Lemma 
5.1  and  5.2  yields  part  (a). 

The  second  part  of  this  lemma  follows  from  Lemma  5.1  and  the  fact  that 
|Sij(k,m)|  1  with  a  standard  variance  argument. 


Choose  an  arbitrary  e  '  >  0.  For  each  k  -  1 . K  and 

a  “  1  *  •••«  M  choose  a  sequence  n.^  e  M^i  n  "  1>  2,  ...  and  a  number 

t  e  T  . 

m  m 


4 


/ 
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Lemma  5.5.  There  exists  an  integer  such  that 

P( ‘R  (t  ,  nk,Y  ))  >  e/3)  <  e*/2KM 

i  m'  n  n  '  — 

for  all  k  *  1 . y. ,  all  a  m  1 . M  and  all  n  >_  N^. 

Proof:  Apply  Lemma  3.2  for  each  of  the  finite  number  of  pairs  (k,ra). 

Wien  these  preliminaries  completed,  the  proof  of  the  theorem  will  now  be 
completed.  For  each  k  •  1,  ....  K  and  m  *  1,  .  M  write 


P(  sup  (R  (t.n  ,Y  ) |  > e) 

teT  ,n  eft  n  n  n  ~ 
m  n  ten 


P( 


sup  | Rn C t , n  ,Y  )  -  R  (t  ,n\  Y  )|  +  |R  (t  ,nk,Y  ) |  > e) 

.  _<m  _  vi  n  tin  n  m  n  n  n  m  n  n 

t  cl  ,  n_  eft 


n  kn 


-  P(Qk  +2Y5'  +  (R  (t  Y)|  >e) 
K,m  n  m,  n  n  — 


•  P(V»-E<W+E(<W  +  2Y{'+  WV  V  i 


±  P<V«,  -  E«k,»)  I  k/3)  +  P(  |R„(V  nR.Y0)|  i  c/3) 


<  (9/e  )  Var(Q  )  +  CeV2KM)  , 

K.,m 


<  e’/KM 


where  Lemma  5.3  was  used  in  the  third  line;  Lemma  5.4  (a)  and  the  choice  of 
5'  were  used  in  the  fifth  line;  Chebvshev's  inequality  and  Lemma  5.5  were 
used  in  the  seventh  line;  and  the  last  line  follows  from  Lemma  5.4  (b)  since 
the  first  term  in  the  seventh  line  above  can  be  made  less  than  e'/2KM  for 


/ 
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all  k,m  for  a  sufficiently  large,  say  for  n  >_  >_  .  Finally, 


P(  sup 

C— t0’nneMn 


IV^vV 


>  e) 


M  K 

<  l  l  p(  sup  |R  (t.n-  Y  )|  >e) 

m-1  k-1 


M  K 

1  l  l  e’/KM  -  e* 
m-1  k»l 


for  all  n  >_  and  this  completes  the  proof. 

This  leads  to  our  final  result  for  the  parameter  x.  Define 

x  “  l/12y  )  where  y  is  defined  in  (5.3). 

no  no  '  na 

A  P 

Theorem  5.2.  Under  the  assumptions  (A.l),  (A. 2),  and  (B.l),  x  -ex, 
-  na 

f  o  r  0  <  a  <  1 . 

Proof:  Assuming  condition  (A. 2)  holds,  first  note  that  (A. 2)  is  equivalent 
to  the  condition, 

1  im  max  |  x . ^  |  / y'n  ■  0 ,  for  k  “  1 ,  . . . ,  p , 
n-*»  lf,i_n 

where  x.,  is  the  (i,k)th  element  of  any  basis  matrix  X  for  ft  .  It 
ik  J  n 

a 

then  follows  from  Sievers  (1983)  that  |  []Jn|  |  is  bounded  in  probability 

where  U  £  ft  is  the  best  R-predictor  of  Y  .  This  can  be  used  to  select 

a  A  >  0  so  that  P(  |  |p  |  |  >  A)  is  arbitrarily  small  for  sufficiently  large 

A  P 

n.  From  Sievers  (1982),  t  t  as  n »  where  t  is  a  positive 

«  o  a 

constant.  Let  tQ  of  Theorem  5.1  be  a  number  larger  than  t  .  Then  by 

a  a 

Theorems. 1,  |R  (t  ,  p  ,  Y  )  I  •+•  0  in  probability  and  the  conclusion 
n  na  n  n 


follows. 
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Theorem  3.1  is  the  special  case  of  Theorem  3.2  when 


a .  . 
J-J 


(V1 
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TABLE 

4.1 

EMPIRICAL 

05 

LEVELS 

A. 

UNBALANCED  2  x 

2 

DESIGN 

ERROR  DISTRIBUTION 

NORMAL 

CN 

SCN 

LOGNORMAL 

HYPOTH 

HYPOTH 

HYPOTH 

HYPOTH 

(1) 

(2) 

(3) 

(1) 

(2) 

(3) 

(1) 

(2) 

(3) 

(1) 

(2) 

(3) 

LS 

044 

035" 

059 

026 

035 

041 

056 

051 

042 

045 

066+ 

039 

Do 

038 

036 

056 

044 

046 

042 

050 

054 

050 

034 

055 

042 

D(  .8) 

042 

036 

063 

041 

042 

040 

049 

059 

051 

050 

075  + 

054 

D(  .9) 

039 

035" 

060 

032 

035“ 

029~ 

047 

057 

048 

040 

065+ 

042 

B(  .8) 

046 

030~ 

064 

042 

043 

044 

052 

065+ 

061 

057 

087  + 

061 

B(  .9) 

044 

032~ 
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026 

032" 

028~ 

046 

054 

052 

038 

065+ 

043 

B .  SMALL 

PARALLEL 

DESIGN 
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(3) 
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(1) 
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057 

044 
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024~ 
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040 
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038 

Do 
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045 

040 

038 
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041 

043 

038 

048 

040 

042 

D(  .  8 ) 

056 

046 

045 

036 

038 

028" 

040 

047 

041 

060 

045 

051 

D(  .9) 

051 

040 

043 

028~ 

024" 

019" 

039 

036 

034" 

046 

034" 

033 

B(  .8) 

056 

048 

059 

033~ 

040 

041 

047 

052 

050 

067  + 

050 

073 

B(  .9) 

050 

040 

048 

025" 

026' 

024' 

033" 

035' 

027' 

044 

030" 

046 

C .  LARGE 

PARALLEL 

DESIGN 

(1) 

(2) 

(3) 

(1) 

(2) 

(3) 

(1) 

(2) 

(3) 

(1) 

(2) 

(3) 

LS 

053 

043 

057 

033~ 

033 

'  029“ 

045 

046 

045 

036 

045 

03b 

Do 

054 

044 

056 

050 

048 

046 

047 

048 

040 

03o" 

039 

027 

D(  .8) 
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047 

056 

048 

046 

044 

048 

050 

042 

050 

064 
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D(  .9) 
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053 
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035" 
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046 

040 

042 

046 
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047 
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039 

028' 

038 

043 

037 

037 

047 
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Table  4.2  EMPIRICAL  POWERS:  Normal  Distribution 
A.  UNBALANCED  2x2  DESIGN 


A 

0.0 

1.3 

3.0 

6.9 

0.0 

1.3 

3.0 

6.9 

0.0 

1.7 

3.4 

10. 

LS 

044 

346 

642 

950 

035 

335 

645 

951 

059 

421 

695 

985 

Do 

038 

324 

608 

926 

036 

315 

599 

939 

056 

390 

655 

979 

D(  .8) 

042 

324 

600 

924 

036 

320 

601 

938 

063 

383 

657 

978 

B(  .8) 

046 

322 

591 

913 

030 

318 

583 

931 

064 

376 

641 

976 

B.  SMALL  PARALLEL  DESIGN 


H0 1  H02 


X 

0.0 

2.0 

3.2 

7.3 

0.0 

1.9 

3.3 

7.3 

0.0 

.9 

1.6 

3.4 

LS 

057 

464 

650 

938 

044 

466 

666 

947 

053 

180 

304 

550 

Do 

053 

42b 

bl5 

923 

038 

418 

614 

925 

045 

157 

241 

471 

D(  .8) 

056 

424 

622 

922 

046 

412 

616 

922 

045 

155 

252 

4o3 

B(  .8) 

05b 

425 

602 

912 

048 

402 

608 

922 

059 

159 
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497 

C.  LARGE  PARALLEL  DESIGN 


X 

LS 

D0 

d(  .8) 


0.0 

2.5 

5.0 

7.2 

053 

562 

854 

951 

054 

533 

835 

933 

051 

531 

835 

937 
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530 

832 
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0.0 

2.8 

6.2 

10.1 

043 

655 
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994 

044 

621 

919 
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047 
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918 
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049 

607 

912 
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5.5 
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14.0 

057 
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944 

995 

056 

790 

933 

993 

056 

783 

934 

991 

063 

773 

936 

989 

B(  .8) 


— 

TABLE  4.3  EMPIRICAL  POWERS:  Contaminated  Normal  Distribution 


A. 

2x2 

UNBALANCED 

DESIGN 

U01 

H02 

HQ3 

A 

0.0 

2.3 

6.9 

12.3 

0.0 

2.3 

6.9 

12.3 

0.0 

3.4 

10.0 

28 

IS 

026 

183 

350 

497 

035 

186 

339 

478 

041 

221 

423 

707 

Do 

044 

392 

699 

863 

046 

382 

677 

865 

042 

422 

804 

976 

DC. 8) 

041 

387 

682 

858 

042 

368 

663 

852 

040 

414 

793 

972 

B(  .8) 
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364 

664 

858 

043 

353 

661 

862 

044 
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799 

978 

B. 

SMALL 

PARALLEL  DESIGN 

H01 

H02 

K03 

A 

0.0 

2.0 
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7.3 

0.0 

1.9 

3.3 

7.3 

0.0 
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8.2 

LS 

024 
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038 
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39  3 

027 
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327 

D0 

040 
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369 

673 

038 

264 
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029 
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584 

D(  .8) 

036 
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367 
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038 
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653 

028 

181 

353 

580 

B(  .8) 

033 
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352 

663 

040 

242 

362 

640 

041 
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358 

581 

C. 

LARGE 

PARALLEL  DESIGN 

H01 

02 

H03 

A 
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5.0 

7.2 

0.0 

2.8 

6.2 

10. 1 
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5.5 
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14. 

LS 
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246 
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033 
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029 

208 
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436 

Do 

050 

308 

551 

7  34 

048 

396 

687 

875 

046 

503 

705 

865 

DC  .8) 

048 

298 

552 

730 

046 

387 

686 

871 

044 

501 

710 

865 

BC.8) 

047 

295 

544 

732 

039 

341 

661 

845 

045 

490 

716 

876 

27 


TABLE  4.4  EMPIRICAL  POWERS:  Skewed  Contaminated  Normal  Distribution 

A.  2x2  UNBALANCED  DESIGN 


H01  HQ2  H03 


X 

0.0 

3.0 

6.9 

12. 

0.0 

3.0 

6.9 

12. 

0.0 

1.7 

3.4 

10. 

LS 
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Do 
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818 

D(  .8) 

049 

400 

733 
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059 
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041 
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711 

038 

217 
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404 

708 

041 
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404 

616 
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C.  LARGE  PARALLEL  DESIGN 
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14.0 

LS 

045 
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709 

885 

046 
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TABLE  4.5  EMPIRICAL  POWERS:  Lognormal  Distribution 

A.  2x2  UNBALANCED  DESIGN 


H01  H02  H03 


A 

0.0 

1.3 

3.0 

6.9 

0.0 

1.3 

3.0 

6.9 

0.0 

1  .7 

3.4 

10. 

LS 
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329 
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039 
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°o 
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260 
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055 
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042 
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057 
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061 
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B .  SMALL  PARALLEL  DESIGN 


V  »02  H03 


A 
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2.0 
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0.0 

.9 

1  .6 

3.4 

LS 

044 
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047 

240 
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038 

090 
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274 

D0 

048 

345 

480 

775 

040 

330 

501 

784 

042 

120 

183 

369 

D(  .8) 

060 

430 

593 

848 

045 

430 

615 

853 

051 

152 

239 

475 

B(  .8) 

067 

419 
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050 

408 

612 

867 

073 

173 

244 

475 

C.  LARGE  PARALLEL  DESIGN 


H01  H02  H03 


0.0 

2.5 

5.0 

7.2 

0.0 

2.8 

6.2 

10.1 

0.0 

5.5 

8.8 

U.O 

03b 

265 

441 

559 

045 
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485 

655 

036 

374 

521 

689 

030 

470 

753 

888 

039 

497 

828 

938 

027 

664 

836 

953 

050 

613 

850 

937 

064 

649 

895 

969 

049 

812 

921 

976 

055 

612 

870 

949 

065 

658 

906 

980 

062 

832 

9  38 

991 

B(  .8) 


Figure  4.1.  G  (t)  and  G  (t)  are  the  empirical  cumulative  distribution 
n  n 

functions  of  the  absolute  differences  of  R-  residuals  and  random  errors, 

respectively,  from  a  simulation  of  lognormal  variables  with  design  A  . 

For  the  slopes,  y  is  from  (3.5)  with  a  ■  .80  ,  y  is  similar  but  based 

*  “ 

on  the  random  errors  and  y  corrects  the  y  as  in  (4.1). 
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